Chapter VIII Modeling and Synthesis of Realistic Visual Speech in 3 D

نویسندگان

  • Gregor A. Kalberer
  • Luc Van Gool
چکیده

The problem of realistic face animation is a difficult one. This is hampering a further breakthrough of some high-tech domains, such as special effects in the movies, the use of 3D face models in communications, the use of avatars and likenesses in virtual reality, and the production of games with more subtle scenarios. This work attempts to improve on the current stateof-the-art in face animation, especially for the creation of highly realistic lip and speech-related motions. To that end, 3D models of faces are used and based on the latest technology speech-related 3D face motion will be learned from examples. Thus, the chapter subscribes to the surging field of image-based modelling and widens its scope to include animation. The exploitation of detailed 3D motion sequences is quite unique, thereby Modeling and Synthesis of Realistic Visual Speech in 3D 267 Copyright © 2004, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. narrowing the gap between modelling and animation. From measured 3D face deformations around the mouth area, typical motions are extracted for different “visemes.” Visemes are the basic motion patterns observed for speech and are comparable to the phonemes of auditory speech. The visemes are studied with sufficient detail to also cover natural variations and differences between individuals. Furthermore, the transition between visemes is analysed in terms of co-articulation effects, i.e., the visual blending of visemes as required for fluent, natural speech. The work presented in this chapter also encompasses the animation of faces for which no visemes have been observed and extracted. The “transplantation” of visemes to novel faces for which no viseme data have been recorded and for which only a static 3D model is available allows for the animation of faces without an extensive learning procedure for each individual.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Realistic Face Animation for a Czech Talking Head

This paper is focused on improving visual Czech speech synthesis. Our aim was the design of a highly natural and realistic talking head with a realistic 3D face model, improved co-articulation, and a realistic model of inner articulatory organs (teeth, the tongue and the palate). Besides very good articulation our aim was also expression of the mimic and emotions of the talking head. The intell...

متن کامل

Visual speech synthesis from 3D video

Data-driven approaches to 2D facial animation from video have achieved highly realistic results. In this paper we introduce a process for visual speech synthesis from 3D video capture to reproduce the dynamics of 3D face shape and appearance. Animation from real speech is performed by path optimisation over a graph representation of phonetically segmented captured 3D video. A novel similarity m...

متن کامل

Constructing Physically Realistic VCV Stimuli for the Perception of Stop Voicing in European Portuguese

In this book chapter we present the generation of physically realistic stimuli with a biomechanical speech production model, with the aim to produce perceptually appropriate VCV sets for the European Portuguese (EP) voicing distinction. The duration measures necessary for the biomechanical model were extracted from an extensive EP speech production database, recorded for this aim. The same data...

متن کامل

The KTH 3D Vocal Tract project (Engwall, 1999) aims at realistic modeling of the intraoral articulator movement in speech, using a rule-based approach to visual speech synthesis

movement in speech, using a rule-based approach to visual speech synthesis (Beskow, 1995). The hope is that a realistic 3D model of the tongue, made visible in the frame of a synthetic face (Lundeberg and Beskow, 1999), as shown in Fig. 1, can be of use in pronunciation training to provide visual feedback to eg. hearing-impaired children. In the current state of the project, the model consists ...

متن کامل

Image-based Talking Heads using Radial Basis Functions

In recent years talking heads have received a great deal of interest, both in their application to natural humancomputer dialogue, and their benefit to the intelligibility of synthesised speech. A model for the realistic synthesis of visual speech animation is described in this paper. Images representing the key visual speech poses (visemes) are pre-recorded and labelled. Transitions between vi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003